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TECHNICAL FIELD 

The present invention generally relates to the field of watermarked media 
signals and the detection of watermarks embedded in media signals. More particularly the 
present invention is directed towards a method, apparatus and computer program product for 
5 enabling at least identification of data symbols that have been embedded in a media signal by 
quantization of a host signal. 



DESCRIPTION OF RELATED ART 
10 It is well known to watermark media signals in order to protect the rights of 

content owners against piracy and fraud. 

Two such watermarking schemes are the so-called QIM (Quantisation Index 

Modulation) and DC-QIM (distortion-compensated Quantisation Index Modulation) 

watermarking schemes, where a watermark is embedded in a so-called host signal by 
1 5 choosing sample values on a sampling grid. In order to get a better statistical resemblance to 

the original signal after watermarking, it is then common to add dither to the watermarked 

signal. 

However, when such a watermarked host signal is transmitted to a receiver, 
the scaling factors used can get rescaled, which scaling factor is proportional to the 

20 quantisation step size. Since the quantisation step size is then incorrect the detection of a 
watermark is hard or even impossible to perform. Such rescaling of the scaling factors can 
take place because of various reasons, such as the properties of the transmission channel 
used. One reason can be that the channel is an analogue channel with an unknown gain 
factor. Another reason can be an attack on the signal by a malicious party. 

25 WO 03/052689 describes one way to retrieve scaling factors from a received 

possibly reseated host signal. In this document received signal samples are divided into three 
different dither value intervals. Histograms are then created using partial probability density 
functions for the signal samples in each interval, and the original scaling factor as well as a 
DC offset can be obtained through analysing these three histograms. Signal samples are thus 
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provided together in groups with many varied dither values before analysis, which means that 
some information in the signal samples might not be used for determining the scaling factors. 
This document also mentions that a pilot signal is used in the retrieval of the scaling factors. 

It would however be advantageous to have an alternative way of determining 

5 scaling factors of a received media signal that can use more information in the different 
signal samples for the determination of this scaling factor. 

SUMMARY OF THE INVENTION 
10 It is therefore an object of the present invention to provide an alternative way 

of determining scaling factors of a received media signal that uses more information in 
received signal samples than what is done within the prior art. 

According to a first aspect of the present invention, this object is achieved by a 
method of enabling at least identification the presence data symbols that have been embedded 
15 in a media signal comprising the steps of: 

obtaining a transmitted media signal, which comprises a possibly distorted 
version of a host signal in which data symbols have been embedded by quantisation using a 
certain quantisation step size and to which dither with a set of dither values has been added, 

providing several dither value intervals within the set, where each interval 
20 corresponds to one or a small number of dither values, 

determining a histogram for each dither value interval, where a histogram is 
determined for all sample values of a set of signal samples of the transmitted media signal 
and having a dither value in the corresponding dither value interval, 

combining the separate histograms corresponding to the intervals into a single 

25 histogram, and 

determining at least a rescaling factor based on the combined histogram in 
order to estimate the quantisation step size. 

According to a second aspect of the present invention, this object is also 
achieved by an apparatus for enabling at least identification the presence of data symbols that 
30 have been embedded in a media signal and comprising: 

a signal obtaining unit arranged to obtain a transmitted media signal 
comprising a possibly distorted version of a host signal in which data symbols have been 
embedded by quantisation using a certain quantisation step size and to which dither with a set 
of dither values has been added, and 
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a signal distortion determining unit arranged to: 

provide several dither value intervals within the set, where each interval 
corresponds to one or a small number of dither values, 

determine a histogram for each dither value interval, where a histogram is 
5 determined for all sample values of a set of signal samples of the transmitted media signal 
having a dither value in the corresponding dither value interval, 

combine the separate histograms corresponding to the interval into a single 

histogram, and 

determine at least a rescaling factor based on the combined histogram in order 
10 to estimate the quantisation step size. 

According to a third aspect of the present invention, this object is also 
achieved by a computer program product for enabling at least identification the presence data 
symbols that have been embedded in a media signal, comprising a computer readable 
medium having thereon: 

15 computer program code means, to make the computer do, when said program 

is loaded in the computer: 

for an obtained transmitted media signal comprising a possibly distorted 

version of a host signal in which data symbols have been embedded by quantisation using a 

certain quantisation step size and to which dither with a set of dither values has been added, 
20 provide several dither value intervals within the set, where each interval 

corresponds to one or a small number of dither values, 

determine a histogram for each dither value interval, where a histogram is 

determined for all sample values of a set of signal samples of the transmitted media signal 

having a dither value in the corresponding dither value interval, 
25 combine the separate histograms corresponding to the interval into a single 

histogram, 

and 

determine at least a rescaling factor based on the combined histogram in order 
to estimate the quantisation step size. 
30 Claims 2 and 6 are directed towards also estimating an offset of the 

transmitted media signal. 

Claims 3 and 7 are directed towards rescaling the transmitted media signal 
using the rescaling factor. 



WO 2005/064535 



4 



PCT/IB2004/052682 



Claim 4 and 8 are directed towards processing the rescaled signal in order to 
detect or extract the embedded data. 

The present invention has the advantage of using all the information in the 
histograms for the different signal sample values for estimating the quantisation step size. 
5 Because of the large number of histograms it is furthermore easy to find the periodicity 

between the histograms, which enables a robust determination of quantisation step size. The 
invention is furthermore not dependent on receiving a pilot signal in order to estimate the 
quantisation step size. 

The essential idea of the invention is that the detector of embedded 
10 information exploits optimally the knowledge it has on dither values corresponding to sample 
positions. In particular it uses its knowledge on which sample positions have the same dither 
value. For all sample positions corresponding to a certain dither value, a (sparse) histogram is 
constructed. These sparse histograms are then combined into a single well-behaved histogram 
from which the relevant scaling parameters can be derived. 
15 These and other aspects of the invention will be apparent from and elucidated 

with reference to the embodiments described hereinafter. 



BRIEF DESCRIPTION OF THE DRAWINGS 
20 The present invention will now be explained in more detail in relation to the 

enclosed drawings, where 

Fig 1 shows a block schematic of an apparatus according to the present 

invention, 

Fig 2 shows a flow chart of a method according to the present invention, 
25 Fig. 3 shows histograms of a received image signal, which histograms have 

been calculated with the method according to the present invention, 

Fig. 4 shows a curve over a Fourier transform of the histograms in Fig. 3, 
Fig. 5 shows a curve over the phase relationship of the transformed histograms 

in Fig. 4, and 

30 Fig. 6 shows a computer program product comprising computer program code 

for performing the method according to the invention. 



WO 2005/064535 



5 



PCT/IB2004/052682 



DETAILED DESCRIPTION OF EMBODIMENTS 

The invention is directed towards estimating original scaling factors of a 
received watermarked signal in order to be able to detect or extract a watermark from a 
received host signal. In order to better understand the invention, some basic concepts about 
5 QIM (Quantisation Index Modulation) and DC-QIM (distortion compensated Quantisation 
Index Modulation) will now be explained. 

A watermark is according to QIM embedded in a host signal through choosing 
quantisation values on a sampling grid. A watermarking formula for this approach can be 
seen as: 

10 

q = (ROUND(s/D - d - b/N) + d + b/N)*D (1) 

where q is the watermarked signal, s is the original signal, D is the quantisation step size, d is 
a dither value in an interval 0 <= d < 1, b is the watermark information and N the number of 

1 5 information symbols in the watermark signal. In many practical applications N is equal to 
two, which indicates a so-called binary alphabet. 

The dither is here provided with a wide range of values, which in one 
embodiment of the present invention is 128 different values between 0 and 1. The dither 
value is a pseudo-random variable that is fixed, independent of the information symbol. The 

20 dither will later be used in order to determine scaling factors, which will be described later. 
The reason dither is provided in the watermarked signal is that the watermarked signal q will 
be virtually indistinguishable from the original signal s, which can be determined by studying 
histograms of the watermarked and original signals. 

Above a QIM watermarking process was described. A variation of this is the 

25 DC-QIM method where the basic quantisation approach is upgraded with a distortion 
compensating component. 

A watermarking formula according to this process would then look like: 

qdc = s + a*(ROUND(s/D-d-b/N) + d + b/N - s/D)*D, (2) 

30 

where a is a properly chosen factor between 0 and 1. 

More details about these watermarking schemes can be found in WO 
03/052689 and in the article "Quantization Index Modulation: A Class of Provably Good 
Methods for Digital Watermarking and Information Embedding'* by B. Chen and G.W. 
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Wornell, IEEE Transactions on Information Theory, Vol. 47, No. 4, May 2001, which 
documents are herein incorporated by reference. 

When a watermarked signal according to any of these two schemes is then 
transmitted to a watermark detector over a channel, it is fairly easy to detect the embedded 

5 message b in a received signal r, which can be performed according to 

b' - Mod(ROUND(rN/D - Nd), N) (3) 

i.e. through dividing the value r by D/N, subtracting the rescaled dither Nd, rounding the 
10 result and performing a Modulo N operation on the rounded value. 

Here it is assumed that the received signal r corresponds to the watermarked 

signal q with some added noise n. 

In reality there is not only added noise, but there can be a rescaling of the 

original signal. This rescaling can be the result of properties of the transmission channel used, 
1 5 which can take place if it is a analogue channel. Such rescaling can also be the result of an 

attack on the watermarked signal by a malicious party. The invention is directed towards 

identifying the rescaling factor of a received signal and then it is a simple task to scale the 

signal in order to estimate the original scaling and thereafter identify or extract a watermark 

from the restored signal. 

20 A model for the relationship of the changed signal and the original signal can 

then be set as: 

r = R*(s + 0) + n (4) 

25 Where R is a multiplicative degradation and O represents an offset in DC 

values. Through the equation: 

r'=r/R-0 (5) 

30 a new signal r' can be determined from which an embedded watermark must be retrieved. 

In order to describe the present invention reference will now be made to Fig. 
1, which shows an apparatus for enabling identification of data signals and to Fig. 2, which 
shows a flow chart for enabling identification of data signals. The apparatus 10 comprises a 
receiving unit 12, which receives a watermarked media signal r over a channel (not shown), 
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step 22. The channel can here be an analogue channel in which case the receiving unit can 
include radio circuits. The received signal r is then delivered to a signal distortion 
determining unit 14, step 24. In the signal distortion determining unit 14, which is the core 
element of the invention, there are then performed a number of fiirther steps, which will be 
5 described next. 

Under the assumption that the received signal r is made up of a number of 
samples that can have values in a value range and also under the assumption that the dither 
intervals are known at the watermark detector side, the signal distortion determining unit 14 
now first sets up a large number of dither intervals, like 128 different intervals that the dither 

10 can take up, step 26. Thereafter the signal sample values of the received signal are grouped 
according to dither interval, that is all signal sample values having dither within a dither 
interval are grouped together. Each dither value interval is here based on only one dither 
value. In this way a number of different sample groups are provided for each dither interval, 
step 28. A histogram is then determined for all the signal samples of each dither interval, step 

15 30, such that a histogram is determined or computed for each dither interval. These 

histograms are then combined into a single histogram, step 31. Thereafter the rescaling factor 
1/R as well as offset O are estimated based on this combined histogram, step 32. How this 
can be done will shortly be described in more detail. At this point it suffices to say that the 
(typically sparse) separate histograms are combined into a single (better behaved) histogram 

20 and that the relevant parameters are derived therefrom. The determined offset -O is then 
provided from the signal distortion determining unit 14 to an adding unit 16, which also 
receives the signal r. The adding unit 16 then adds the offset -O to the signal r and thus 
subtracts the offset O from the received signal, step 34, and provides the thus adjusted 
received signal to a multiplying unit 18. After the rescaling factor 1/R has been estimated by 

25 the signal distortion determining unit 14, this factor is also forwarded to the multiplying unit 
18. The rescaling factor 1/R is then multiplied with the adjusted received signal in order to 
scale the signal, step 36, in order to retrieve the original level or original quantisation step 
size of the watermarked signal. The thus rescaled signal r* is then supplied to a watermark 
detector 20, where watermark detection is performed, step 38. 

30 How the scale factors can be determined will now be described in relation to 

Fig. 3, which shows a matrix of a number of histograms made according to the principles of 
the invention for an image having 256 signal levels and 128 dither value intervals with D set 
to 16 and N set to 2. The histograms thus occupy a column each. As can be seen the 
combined histograms together provide a number of lines that go from left to right and where 
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these lines are slightly angled upwards. The angle is due to the fact that the dither raises the 
level of the signal. Each histogram thus contributes to these lines. By studying the angle by 
which a line is raised, it is possible to determine the scale factor of the received signal and 
from that rescale the signal. In the figure a Hanning window has been applied in the vertical 
5 direction* which explains the darkening at the top and bottom of the matrix. From index equal 
to 1 to index equal to 128 there is phase shift of ND = 2D. From this figure it can be seen that 
the multiplicative degradation is about 0,95. For each dither interval there is furthermore a 
clear periodicity. 

Mathematically the histogram matrix of Fig. 10 can be described as: 

10 

h(k,n) = h(n/R - (k/K + 0)D) (6) 

Where h(k,n) is the frequency of sample n for dither interval k, K is the 
number of dither intervals, O is the offset and R the rescaling factor. As each column in the 
15 matrix is a shifted version of any other column and as the periodicity of the signal is most 
easily retrieved in the frequency domain a FFT (Fast Fourier Transform) is applied along the 
vertical axis n, but before that a window function w(n) is applied in order to reduce wrap 
around effects. 

The application of a window function leads to the following equation: 

20 

h'(M) = w(n)*h(k,n) (7) 

where w(n) is assumed to be sufficiently fast decaying to approximately have h'(k+m,n) = 
h'(k,n - m*D/k), i.e. shift variance after windowing. Then the vertical Fourier transform is 
25 applied according to: 

H'(k,v) = Sum_n h'(k,n) exp(-2*7t*j*n*v/M) (8) 
= exp(-2*7i*j*(k/K + 0)*D*R*v/M)H'(v) 

30 where the size of the set of signal values is M. For each column in Fig. 3, it can therefore be 
seen that the Fourier transform differs only in phase and not in amplitude. An estimation of 
|H'(v)| of the combined histogram h(n) (lying all separate histograms on top of each other) 

can be obtained by averaging | H'(k,v) I over the variable k according to: 
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I H'(v) I ~ (Sumjc | H'(k,v) I )/K (9) 

This is outlined in Fig. 4, which shows | H'(v) I plotted against v. From this 
figure it can be seen that I H'(v) | has strong peaks, corresponding to the periodic behaviour 
5 of the original histograms, where the distance between the peaks has a period of RD. The first 
strong peak in the figure corresponds to the fundamental period RD, which in the frequency 
domain is the fundamental frequency M*N/(R*D). In the figure this frequency is estimated as 
33, from which a factor R = 0.96 can be determined. 

This fundamental frequency can also be used for determining the offset O, 
10 since the phase behaviour of this fundamental frequency (as a function of k) is given by: 

Phase(k)/(2*7t) = -(k/K + 0)*N (1 0) 

In Fig. 5 this phase relationship is plotted as well as the best linear 
15 approximation (straight line). It can clearly be seen that from minimal to maximal dither 
index two full cycles are covered. The offset value is estimated at 0.1 7*D. 

The different units in the apparatus can be provided in either software or 
hardware. In the case of software, the signal distortion determining unit possibly together 
with the adding and multiplying units can be provided in the form of one or more processors 
20 together with corresponding program memory for containing the program code for 

performing the method according to the invention. The program code can also be provided on 
a computer program product, of which one is shown in Fig. 6 in the form of a CD ROM disc 
40. This is just an example and various other types of computer program products are just as 
well feasible. The program code can furthermore be downloaded from a server, perhaps via 
25 the Internet. 

With the present invention there are several advantages obtained. All the 
information in the histograms for the different signal sample values is used for estimating the 
quantisation step size. Because of the large number of histograms it is furthermore easy to 
find the periodicity between the histograms, which enables a robust determination of 
30 quantisation step size as well as offset. The invention is furthermore not dependent on 

receiving a pilot signal in order to estimate the quantisation step size. Moreover, the method 
is also easily extended to situations where the scaling factor is not constant over the range of 
sample values. 
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There are several variations that can be made to the present invention. The 
media signal need not be received in the apparatus. Instead the apparatus can fetch the signal 
from a memory. The apparatus need furthermore not include the watermark detector. This 
detection can be performed in another apparatus. The number of dither values used can also 
5 be varied and is in no way limited to 128. Therefore the present invention is only to be 
limited by the following claims. 



